Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web corpus for crime analysis

نویسندگان

  • Kar Wing Li
  • Christopher C. Yang
چکیده

based approach to align English/Chinese Hong Kong Police press release documents from the Web is first presented. We also introduce an algorithmic approach to generate a robust knowledge base based on statistical correlation analysis of the semantics (knowledge) embedded in the bilingual press release corpus. The research output consisted of a thesaurus-like, semantic network knowledge base, which can aid in semanticsbased crosslingual information management and retrieval.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Cross-Lingual Networks of Concepts from the Hong Kong SAR Police Department

The tragic event of September 11 has prompted the rapid growth of attention of national security and criminal analysis. In the national security world, very large volumes of data and information are generated and gathered. Much of this data and information written in different languages and stored in different locations may be seemingly unconnected. Therefore, cross-lingual semantic interoperab...

متن کامل

Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws

The information available in languages other than English in the World Wide Web is increasing significantly. According to a report from Computer Economics in 1999, 54% of Internet users are English speakers (“English Will Dominate Web for Only Three More Years,” Computer Economics, July 9, 1999, http://www.computereconomics. com/new4/pr/pr990610.html). However, it is predicted that there will b...

متن کامل

What Really Matters: Living Longer or Living Healthier; Comment on “Shanghai Rising: Health Improvements as Measured by Avoidable Mortality Since 2000”

The decline in Avoidable Mortality (AM) and increase in life expectancy in Shanghai is impressive. Gusmano and colleagues suggested that Shanghai’s improved health system has contributed significantly to this decline in AM. However, when compared to other global cities, Shanghai’s life expectancy at birth is improving as London and New York City, but has yet to surpass that of Hong Kong, Tokyo,...

متن کامل

English-Japanese Cross-lingual Query Expansion Using Random Indexing of Aligned Bilingual Text Data

Vector space models can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. In this paper, we report on our NTCIR 2002 experiments using the Random Indexing vector space method for extracting an English-Japanese cross-lingual thesaurus from aligned English-Japanese bilingual data. The crosslingual thesaurus has been used for automatic...

متن کامل

The Contribution of Ageing to Hospitalisation Days in Hong Kong: A Decomposition Analysis

Background Ageing has become a serious challenge in Hong Kong and globally. It has serious implications for health expenditure, which accounts for nearly 20% of overall government expenditure. Here we assess the contribution of ageing and related factors to hospitalisation days in Hong Kong. We used hospital discharge data from all publicly funded hospitals in Hong Kong between 2001 and 2012.  ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 56  شماره 

صفحات  -

تاریخ انتشار 2005